o 

(N 



O 



On estimation of analytic density function in L 



p 



Natalia Stepanova 
Carleton University, Canada 

October 11, 2011 



Abstract 



Let Xi,X2, ... be a sequence of independent identically distributed random variables 
with an unknown density function / on R. The function / is assumed to belong to a certain 
. class of analytic functions. The problem of estimation of / using Lp-risk, 1 < p < oo, is 

considered. A kernel-type estimator /„ based on Xi, . . . , A„ is proposed and the upper 
bound on its limiting local minimax risk is established. Our result is consistent with a 
^-H ' conjecture of Guerre and Tsybakov (1998) and augments previous work in this area. 

' Keywords: minimax estimation, analytic density, kernel-type estimators 

1 Introduction 

Let Xi,X2, ... be a sequence of i.i.d. random variables with unknown density function /o on 
^ , R. The function /o is assumed to belong to a class of analytic functions. The problem is 

■ to estimate /o using Lp-risk, 1 < p < oo. Specifically, to judge the quality of an estimator 

O . fn{x) = fn{x, Xi, . . . , Xn) of fo{x) we will be using Lp-risk, 1 < p < oo, based on a loss 

function L{x) of the type 

L(x) = l(\\xL), 



where / : [0, oo) — t- M is a function from a general class of loss functions C and for a function 
/ in Lp = Lp(R) 

><; ll/llp= (^l/Wrdi) l<P<oo. 

^ In nonparametric estimation, we only know a prior that an unknown function (such as 

density function, distribution function, or regression function) belongs to some functional 
class. Let / be an estimated function that is known to belong to a functional space J-'. For 
I G C consider the minimax Lp-risk 

7^„(^)=infsupE^/(||/„-/||p), 

where fn is an arbitrary estimator of /. If an estimator /„ of / is such that 

supE^/(||/„-/||p)~7^„(^), 

where the relation a„ ~ 6„ means lim„, _j.oo On/^n = then /„ is called an asymptotically 
efficient (or asymptotically minimax) estimator of / with respect to Lp-risk. After the work 
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of Pinsker (1980) the problem of constructing asymptotically efficient estimators with known 
exact rates, i.e., optimal rates including optimal constants, aroused considerable interest. In 
this regard, we can mention the works of Nussbaum (1983), Ibragimov and Hasminskii (1984), 
Korostelev (1993), Donoho (1994), Golubev, Levit, and Tsybakov (1996), Schipper (1996), 
Guerre and Tsybakov (1998), Levit and Stepanova (2004), etc. 

Sometimes estimated functions admit more precise locally asymptotically efficient estima- 
tors (see, e.g., Golubev and Levit (1996), Belitser (1998)). Specializing to the estimation of the 
density function /q, suppose that for any sufficiently small vicinity V of /o in an appropriate 
topology, an estimator /„ satisfies 

supE/Z(||/„ - f\\p) ~ inf supE//(||/„ - f\\p). (1) 
fev fn /GV 

Then /„ is called a locally asymptotically efficient (or locally asymptotically minimax) estimator 
of /o with respect to Lp-risk. 

Among a variety of functional classes traditionally studied in nonparametric estimation, 
a class of analytic functions plays an important role (see, e.g., Levit and Stepanova (2004), 
pp. 254-255, for discussion and references). As an alternative to classes of functions of finite 
smoothness, analytic functions were first used for the purpose of nonparametric estimation by 
Ibragimov and Hasminskii (1980). For the famous Gaussian white noise model, the problem 
of asymptotically efficient estimation of analytic regression function using Lp-risk, 1 < p < oo, 
was solved by Guerre and Tsybakov (1998) in a univariate case, and by Levit and Stepanova 
(2004) in a multivariate case. At the same time, due to the lack of normality of observations, 
an analogous problem of constructing (locally) asymptotically efficient estimators of analytic 
density remains largely unsolved. A conjecture in Remark 5 of Guerre and Tsybakov (1998), 
which is based on the asymptotics of the stochastic part of the Lp-error of kernel density 
estimators in Csorgo and Horvath (1988), says that for a general class of loss functions I and 
any 2 < p < oo 

hm inf supE/KV'p-'HII/n - /Hp) = Hm suv^fl{^-\n)\\f* - /||p) = Z(l), (2) 

fn feA /gA 

where /* (x) is the kernel density estimator of the form 



nhn \ hn J 'TTt logn 

ipp{n) is the rate function given by (21), and A = A(7, M) is a class of density functions such 
that each / G A admits an analytic continuation to the strip Sy = {x + iy : \y\ < 7} with 
7 > such that f{x + iy) is analytic on the interior of Sj, bounded on and for some M > 

/ \fix + i-f)\'^dx <M. 

The only case when the limiting relation (2) is proved is for p = 2 (see Theorem 2 of Schipper 
(1996)). A partial solution to the conjecture of Guerre and Tsybakov (1998) was recently 
provided by Mason (2009) who showed that for a general class of loss functions I, for each 
2 < p < 00 and / G A(7, M) one has (see Mason (2009), Sec. 3.3) 

limE^ (V';'(n)||/:-/||p)=/(l), 
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where ijjpin) is given by (21) and /^(x) is as in (3). We also know that for any 2 < p < oo (see 
Mason (2009), p. 102) 



hmsup sup E/ ( Vn/i„||/* - /lip) < oo, (4) 

n^oo /GLj,/2(B)nA(7,M) ^ ^ 

where Lp/2(^) is the class of densities / on R such that ||/||p/2 < B for some B > 1. 

The main result of this paper. Theorem 1, augments previous work on this topic. For a 
general class of loss functions I, we construct a kernel- type estimator of analytic density /o 
such that for any 1 < p < oo, cf. (2) and (4), 

lim limsupsupE/l(V^;i(n)||/n - f\\p) < (5) 

V\/o n-^oo /eV 

where ijjp{n) is given by (21) and V is a vicinity of /o in a suitable topology. We expect that 
the lower bound 

lim liminf inf supEyZ(V'-^(n)||/„ - f\\p) > /(I), 

V\/o n->oo ygv 

which jointly with (5) would ensure that our estimator /„ is locally asymptotically efficient, also 
holds. The derivation of the lower bound, however, is a more intricate problem that "requires 
a considerable amount of ingenuity and special techniques" (see Mason (2009), p. 70). We 
shall not treat this problem here. 

The paper is organized as follows. In Section 2 we define the class of analytic functions. In 
Section 3 we introduce the kernel-type estimator of an unknown analytic density function and 
establish its basic properties. The main result of the paper, Theorem 1, is stated in Section 4, 
and its proof is given in Section 5. Auxiliary results and their proofs are collected in Sections 
6 and 7. 



2 An analytic functional class 

2.1 Preliminaries 

In order to construct an appropriate functional class, we start with the class J^ij) of functions 
f{z), z = X + iy E C, such that 

a) / is analytic in the strip = {x + iy : \y\ < 7}; 

b) u{x, y) = Uf{x, y) = Ref{x + iy) is bounded in S'-y; 

c) / is real on the real axis. 

Under these conditions the limits 

u{x) = \\m. u{x^y) = lim u(x,y) 

are known to exist and be equal, for almost all x, and the function f{z) admits the following 
representation 

/oo 
G{z — s)u{s) ds, \lmz\ < 7, (6) 
-00 
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where ^ 

" 27 cosh 1^' 

Moreover, the whole class -^(7) will be obtained in this way by allowing any function u with 
||u||oo < 00 in (6). Sec Akhiezer (1965), Sec. 110, p. 267-268 (without detailed proof); Timan 
(1960), Sec. 3.8.5, p. 150. 

More general analytic classes are obtained by using the following arguments. Note that for 
\y\ < 7, 

1 cosh If COS ^ 

ReG(x + iy) = ^ ^^^^ ^ > 0, 

^ 27 sinh2 1^ + cos2 1^ 

and 

1 sinhffsinf^ 

lmG{x + iy) — — 



27 sinh^ ^ + cos2 ^ ■ 



Thus by denoting 

we get for any \y\ < 7 



Try ■ 
a = cos — and t = smh — , 
27 27' 



ReG{x + iy)dx = / 

-00 J —i 



dt 



a7r((t/a)2 + l) 
or equivalently 

\\ReG{-+iy)\\^ = l. 

Therefore, by the generalized Minkowski inequality (Akhiezer (1965), Sec. 5, p. 14), for any 
1 < p < 00, 

\\Ref{- + iy)\\p = ||n(-,y)||p < ||n||p, (|y| < 7). 

Assuming \\u\\p < 00, 1 < p < 00, we get for y ±7, cf. Stein (1970), Theorem 2b-c), 
pp. 62-63, 

u{x,y) — >■ u{x), a.e. 

and 

\H-,y) -u{-)\\p -> 0. 

These considerations lead to the following definition. For given 7 > 0, M > 0, and 1 < p < 00, 
denote by J"(7,p, M) the class of functions f{x), x G R, satisfying the following conditions 

a) / admits representation (6), in which 

b) \\u\\p < M. 

Note that such functions are real on the real line, admit analytic continuation in the strip 
= {z : \Iiaz\ < 7}, and for almost all x have limiting values u{x) at the boundary 
llmzj = 7. 

Remark 1. In the case of 27: -periodic analytic functions f{z), which are real on the real axis, 
the representation (6) holds if and only if (see Sarason (1965), Wilderotter (1996)) 

< 00. 



sup I / \u{x, y)f dx I 
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It seems plausible that in our case a similar role is played by the condition 



sup ||u(-,7/)||p < OO. 
|j/|<7 



2.2 The best harmonic approximation of classes J^(7,p, M), 1 < p < oo. 

Recall that an entire function g{z), z G C, is said to be the entire function of exponential 
type N if for any e > there is a positive constant A = such that \g{z)\ < ^e^^+^^l^l for 
all z G C. Denote by £j\f the class of entire functions g{x) of exponential type such that 
g G Li. It is known that such functions arc necessarily bounded on R (see Akhiezer (1965), 
Sec. 83). Hence, they belong to Lp, for any p > 1. The Fourier transform of g & 6^ will be 
denoted by g: 

g(t) = [ gix)e-'^''dx, t G R. 

By the Wiener-Paley Theorem (see Akhiezer (1965), Sec. 82) the Fourier transform g[t) of any 
function g E £n vanishes outside the interval [—N,N]. 

The problem of the best approximation in Lp of functions / G J-{'~f,p,M) by functions 
g G £n is discussed in Akhiezer (1965), Ch. IV-V. Further references can be found therein. 
The importance of such approximations in nonparametric density estimation has been demon- 
strated in Ibragimov and Hasminskii (1980). 

A starting point in finding such an approximation is to find a best harmonic approximation 
Gn G £n in Li for the function G{x) appearing in (6). Here we cite the following result from 
Akhiezer (1965), Sec. 88, p. 207, and Sec. 110, p. 268. There exists a function Gn G £n such 
that 

||G-G^||i= inf ||G-<7||i = -y— , -— <-e-^^. 

" " sefjv ' ' TT ^ (2A; + 1) cosh(2A; + l)iV7 tt 

Note that since UGatHi < oo, the function /]^(x) = (Gn * u){x) is in L^. Moreover, Akhiezer 
(1965) shows (sec Sec. 97, p. 228; and Sec. 100, p. 237) that /j^ G £n- Thus another application 
of the generalized Minkowski inequality gives, uniformly over J^{^,p, M), 

11/ - fMp = \\iG- Gn) * u\\p < \\G - Gn\\i\\u\\p < ^e-^^. 



This is of course a merely existence result, as f^ cannot be viewed as 'constructive' approx- 
imation. Indeed, finding f^ involves knowing u, which in turn requires solving the integral 
equation (6). This is known as incorrect problem in the Theory of Integral Equations. Never- 
theless, the very existence of f^ with the above properties can be used to produce a 'construc- 
tive' approximation to function / G T{'y,p,M) which is nearly as good as its best harmonic 
approximation. 



2.3 Multipliers 

Consider the class jv of functions g such that 



a) ge Li(R); 

b) g{t) = 1, for \t\ < N. 
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Such functions we will call multipliers, in a slight variation of how this term is defined in the 
Theory of Approximation. Multipliers produce constructive approximation to a given function 
/ in any Lp, 1 < p < oo, in the form g * f, which is almost as good as the best harmonic 
approximation of /. Indeed, using the best harmonic approximation from the above, and 
noting that by the Convolution Theorem g * = /]^, we obtain, by another application of 
the generalized Minkowski inequality, that for any g G Mi\f, uniformly over J^('j,p, M), 

\\g*f-f\\p = \\g*{f-fN + rN)-f\\p = \\g*{f-rN) + {rN-mp 

< \\g * if - fmp + m - fWp < (1 + \\g\\i)\\rN - fWp 

< 8^(1 + Nil) ,-.iv. (7) 

TT 

In other words, an application of a multiplier to a function / affects the rate of the best 
harmonic approximation only by a factor (1 + H^Hi). 



2.4 A class of density functions 

Now we introduce the functional class of interest. When estimating density function using 
Lp-risk, we will distinguish between the two cases: 1 < p < 2 and 2 < p < oo. The case of 
1 < p < 2 requires separate consideration and more severe assumptions on the underlying class 
of functions. Specifically, let 7 > and M > be given numbers, and let for 2 < p < 00 



F(7,p, M) = {/ : / is a density on R and / G J^{j,p, M) n Lp/2}, 
and for 1 < p < 2 and some (2 — < A < 2 

F(7,p,M) = {/ : / is a density on R, / G J^{'y,p,M) and / \x\^f{x)dx < 00}. 

By Lemma 1 of Mason (2009), when 1 < p < 2 any function / G F(7,p, M) is in Also, 
for all 1 < p < 00 any function / G F(7,p, M) is a bounded function that belongs to Lp. 
Indeed, by (6) and Holder's inequality for any / G F(7,p, M) and all a; G R 



1/(^)1 < [ \G{x-s)u{s)\ds<\\G\\ 
Jn 



^ =:Ci,,P,M)<oo, 

and 

IX < 00. 



\f{x)\Pdx < max / \f{x)\P/^da 



Now let T(7,p, M) be the topology on ¥{'y,p,M) induced by the distance 

11^/ - '^gWp + Ir \x\'^\fix) - g{x)\dx, l<p<2, 
\\uf - UgWp + \\f - g\\p/2, 2<p<oo, 

where A is the same as in the definition of F(7,p, M) and 



Pp{f,g) 



Uf{x)= lim Iief{x + iy), Ug{x) = lim Keg{x + iy). 
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This is a strong topology in the sense that closeness with respect to pp implies, via (6) and 
Holder's inequality, closeness in the uniform topology. Also, by the inverse triangular inequal- 
ity, closeness with respect to pp implies closeness of the Lp/2-norms, 2 < p < oo. With respect 
to the topology T(7,p, M), we have 

(Al) for all 1 < p < 2 and some (2— < A < 2 as above, locally uniformly in / G F(7,p, M), 
the integral J|j.|>s \x\^f{x)dx converges to zero as S — >■ oo; 

(A2) for 2 < p < oo, locally uniformly in / G F(7,p, M), the integral J\y.\^B f^^"^^^)*^^ 
verges to zero as S — >■ oo . 

Remark 2. For 1 < p < 2, due to (Al) and Lemma 1 of Mason (2009), locally uniformly in 
f G ¥(-f,p,M), the integral Jm^b f^^^i^)*^^ converges to zero as B ^ oo. 



3 Kernel-type estimators 

In this section, we assume that X, Xi, X2, ■ ■ ■ are independent random variables with common 
density function / G F(7,p, M), 1 < p < 00. 

3.1 Construction of the kernel 

To estimate analytic density with Lp-risk, we will be using a kernel-type estimator of the form 



1 " 

fn{x) = -^g{x - Xi), xeK, 



n 

i=l 

where g G Mn (see Section 3.1 for the definition of Mn)- Wc will sec that the systematic 
error (the bias term) of a such estimator is completely determined by \\g* f — f\\p, whereas its 
stochastic term is determined by 

ll5l|2||/||p/2- 

Note that by the Parseval inequality for any g e Mn, 

\\gg = {27ry'\\g\\l>N/7r. 

Thus our goal is to find a multiplier 5 G A^at whose L2-norm is 'close' to the lowest possible 
bound sJN/t: and which, at the same time, has a reasonably well-behaved Li-norm In 
other words, the question is - informally - how to make \\g\\2 and H^Hi both small? For this, 
we will consider a classical family of kernels well known in Approximation Theory: 

, , cos 6'a: - cos a; ^ ^ ^ 

7r(l — 6)x^ 

In Akhiezer (1965) they are called Fejer-type kernels. For = 1 we obtain, as a limiting case, 
the sine function: 

k(x) = k(x; 1) = . 

TTX 

As other special cases we obtain the Fejer {6 = 0) and the Vallee Poussin {6 = 1/2) kernels 
(see the Table below). 
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k{x) 


9 


Kernel 


2sin-f 





Fejer 


2(cos(a;/2)— cos x) 


1/2 


Vallee-Poussin 




smg; 
-7r.x 


1 


sine function 



The Fourier transforms of these kernels are given by 

k{t; 0) = m <e) + < \t\ < 1). (8) 

We have 

l|A:(-;^)lli = ^ 

and, cf. Akhiezer (1965), Sec. 106, p. 255, 

^logii| + i<IW.;«)||,<±logi±| + 2. 

Using the above family of kernels, it is easy to construct, for any < < 1, a multiplier 

A: AT G AT as 

N f Nx \ 

Note that by the above formulas 

||M-;e)lli = l|fc(-;^)l|i<^iog[^ + 2 

and 

f < \\kNi-,e)\\l = j\\k{-;9)\\l = ^(l + ^y (9) 

Now, a meaningful choice of the kernel from the above family is obvious, at least when N 
becomes large. To achieve an asymptotic equality between left and right sides in (9) we choose 
9 = 9n 1 as iV — >■ oo. For instance, if we choose 0jv = 1 — c > 0, we get 

\\kN{-,e)\\l = ^ + o{i) 

and, at the same time, 

\\kNi-,e)\\i = \\ki-;9)h=0{logN). 

Definition. We call a sequence of kernels k^ix) € Mn an asymptotic {£n i^ij-fnultiplier, if 

a) fcjv(t) = 1 for \t\ < N, 

b) f llfeivlli^l, asiV^oo, 

c) ||/cjv||i = 0(log AT), as AT ^ oo. 

Remark 3. Our definition catches some of the characteristic features of multipliers, as they 
are defined in Approximation Theory, but is somewhat different by taking into account the 
stochastic nature of nonparametric estimation. Neither of the classical kernels are multipliers, 
in the above sense. The sine kernel satisfies a) and b), with an exact equality \\k\\2 = N/tt, 
but is not in Li. The Fejer kernel does not satisfy a) whereas the Vallee-Poussin kernel does 
not satisfy b). 
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3.2 Definition of the kernel-type estimator 

Following the discussion of the previous subsection, we attempt to estimate / by means of the 
kernel-type estimator 

1 " 

fn{x) = -y2kh^{x-Xi), xeK, (10) 
1=1 

where kh^{x) = hn~^kn {hn^x) is the scaled Fejer-type kernel based on (see Section 3.1) 



cos(6'„a:) - cos(a:;) 
= .(l-g.)x^ ' ^''^ 



with 

A remarkable feature of our estimator fn{x) is that the sequence kh„{x) is an asymptotic 
(£^jV)Li)-multiplier (with N = ^^r)- To see this, observe that (see Section 3.1) 

kn{x) = kni-x), / kn{x)dx = l, max /s„(a;) = fe„(0) = -; (12) 
kn{t) = l, for|t|<0„; (13) 
ll^n||i = ^ = - + 0(A^-^), n^oo; (14) 

^ 1^ + 3 ^ 11^-11^ 1^ + 2' 



and hence 

Therefore 
and as n — >■ oo 



\kn\\i = 0{logN), ra->oo. (15) 
fcft„(i) = l for |i|<iV, 



j;^\\khJl^l, ||A:;,J|i = 0(logiV). (16) 

In subsequent discussion, the fact that kh„{x) is an asymptotic (iSjv, Li)-multiplier will allow 
us to apply inequality (7) for establishing some useful properties of this function. 
Note also that the parameters hn, On and N are chosen to satisfy 



/i„ — >■ and nhn — >■ oo as n — >■ oo. 
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3.3 Some properties of the scaled kernel 

In this section, we establish further asymptotic properties of the scaled kernel 
We have 

maxkh„{x) = kh„iO) = (17) 
and, uniformly in / G F(7,p, M), as ra ^ oo 

Bfkh^x -X) = fix) + 0{n-^/^ log N), (18) 
Efkl{x-X) = M + 0{logN). (19) 

TTlTji 

Equality (17) is obvious. Let us prove (18) and (19). We get from (7), (15), and (17) that, 
uniformly over F(7,p, M), 

sup\Bfkh^ix-X)-fix)\ = ||fc;,„./-/|U< ^^'^^^^"^'"""^^ e-^^ 

= 0{n-^/'^ log N), 

which yields (18). 

The proof of (19) is a more dehcate problem. For given / G F(7,j3, M) and x G R, consider 
the function 

khn{x-t)f{t), ten, 

and observe that 

Efklix -X)= I klix - t)fm = ikn„ * [knAx - •)/(•)]) (^)- 

In order to prove (19) we shall apply inequality (7). We can do this because kh^{x) is an 
asymptotic (£^jv, Li)-multiplicr (sec Section 3.2 for details), and for any / G F(7,p, M) and all 
a; G R, the function kh„{x — t)f{t) belongs (as a function of t) to the class F(7,p, M„), with a 
constant M„ = 0(n^/^), as n ^ oo. 

The latter fact is easy to verify. Indeed, for any a; G R, 7 > 0, and all 1 < p < 00, 
uniformly over F(7,p, M), 



||Re kh^ (x - (• ± i7))/(- ± i7) lip < M sup \kh^ {x-{t± 

MNhn I cos (Nx - Nit ± i'y)) - cos ih-^x - h-^(t ± ij)) \ 
■ sup ' ' 



TT teR |x-(t±i7)|2 
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Therefore, noting that when t = x one has 

fix) 



we infer from (7) and (16) that, uniformly over F(7,p, M), 



sup|E/feI„(x-X)-(7r/i„)-V(x)| 

= sup \{kh^ * [kh^x - •)/(•)]) {x) - khSx - x)f{x) 
xeB. 

<^!^2!!^^!il±«Mlle-- = 0(logiV), 

TT 



which yields (19). 



4 Main result 

The main result of the paper deals with an upper bound on the local maximal Lp-risk of our 
estimator /„ of density / G ¥{'j,p,M), and provides a partial solution to the conjecture of 
Guerre and Tsybakov (see relation (2)). Before stating the result, for 1 < p < oo put 

Pp = 7r-'/Wj;,M„ M, = (E|Ar(0, l)^)^^' = ^2 (-^F (^)) , (20) 
and define the rate function by 

V'p(n) = (n/i„)-V2/3p. (21) 

Denote by £ = JC{A,B) a class of loss functions that consists of non-decreasing functions 
i : [0, cx)) — )■ R such that Z(0) = 0, l{x) is continuous at a: = 1, and for some positive constants 
A and B, 

Z(x) < Ae-^l^l, xG[0,oo). (22) 

We have the following theorem. 

Theorem 1. Let Xi, X2, . . ■ be a sequence of i.i.d. random variables with density function 
fo G F(7,p, M), and let fn be an estimator of fo given by (10). Then for any I G C and any 
1 < p < 00, 

lim limsupsupE/Z(V'p ^(n)||/„ - /||p) < /(I), 
v\/o fev 

where V is an arbitrary vicinity of fo in the topology T(7,p, M). 
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5 Proof of Theorem 1 

Let / G F(7,p5 M) and let fn{x) be an estimator of f{x) given by (10). Consider the variance- 
bias decomposition 

fn{x)-fix) = {Uix)-BfU{x)) + iEfUix)-fix)) 

= {nhnr^^^^n{x) + bn{x), X E R, (23) 

where ^n{x) = ^n{x,Xi, . . . ,X„) is an infinitely differentiable zero-mean stochastic term and 
bn{x) is a bias term defined as follows: 

Ux) = - 5^(fcft„(x-X,)-E;/cft„(a;-Xi)), 

^'^ ^ i=i 

bn{x) = BfkhJx-Xi)-f{x). 

The proof of Theorem 1 is based on the following four lemmas proved in Section 7. 

Lemma 1. For any 1 < p < oo, locally uniformly in f G F(7,p, M), 

where Pp is given by (20). 

Lemma 2. For any 1 < p < oo, locally uniformly in f & ¥{'y,p, M), 

Var/ ^ 0, n^oo. 

The next lemma deals with the bias term 6„. 

Lemma 3. For any 1 < p < oo, locally uniformly in f & F(7,p, M), 

\\bn\\p = o(E/||/n - E//n||p) ■ 

The fourth lemma ensures the uniform integrability of the sequence {KV'^^(^)ll/n ~ /lip)} 
uniformly in a vicinity of /o G ¥{'y,p, M). 

Lemma 4. For any I G C and 1 < p < oo, there exists r > such that, locally uniformly in 

fe¥{-f,p,M), 

limsupE//i+^(V'-'(n)||/„ - f\\p) < oo. 

n— >-oo 

Proof of Theorem 1. Lemmas 1 and 2 in conjunction with Chebyshev's inequality imply, 
locally uniformly in / G F(7,p, M), 
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iP;\n)\\fn-Bff4p^l, 



Since l{x) is continuous at a; = 1, the combination of this relation with Lemmas 3 and 4 yields 
via (23) the conclusion of Theorem 1. □ 



6 Proofs of Lemmas 

We shall first formulate several preliminary results that will be used in the proofs of Lemmas 
1-4. 

6.1 Propositions 

In the next section, under the conditions of Theorem 1, we shall prove three technical propo- 
sitions connected to the stochastic error ^„(x). 

Proposition 1. Let (,nix) be the stochastic term in decomposition (23), where Xi,X2 ■ ■ ■ , is 
a sequence of i.i.d. random variables with density function f G F(7,p, M). For any p > 2 we 
have for some constant Lp > 0, and all r > 1 and n> 1, 



E/ii6.ii;< 



rLr. 



+ logr 



(n/i; 



r/p 



r/p 



+ 



+ 



For p = 2 we have for some constant L2 > 0, and all r > 1 and n > 1, 



EfUn\\l<(j 



rL2 



+ logr 



+ 



hn 



(24) 



(25) 



Moreover, for a given 1 < p < 2 and some (2 — p)/p < X < 2, we have for some constant 
Lp > and all r > 1 and n> 1, 



CVf (A,p) (1 + 2^-1 (E|X|^ + /i^E|y„|^) 



r/p 



+ 



{nhnf 



(26) 



where X has density f{x) and has density k^{x)/\\kn\\2- 

An immediate consequence of Proposition 1 is the following corollary. 

Corollary 1. For any 1 < p < 00 and t > 0, whenever the conditions of Theorem 1 hold, 
locally uniformly in f E ¥{'y,p, M), 



limsupE/exp (ty^nh^\\fn - BffnWp) 



< 00. 
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In terms of the process $,n{x) we have for every t > 0, locahy uniformly in / G ¥{'y,p, M), 

UmsupE/exp < oo, 

n— J-oo 

Proof of Corollary 1. Observe that for 2 < p < oo the norm ||/||p/2 of / G V can be made 
as close to ||/o||p/2 desired, since a vicinity V of /o can be made arbitrarily small. Similarly, 
for 1 < p < 2 the expectation EjjXl'^ can be made as close to Ejq|X|'^ as desired. Also, for 
all 1 < p < 2 and all (2 - < A < 2, as n ^ oo 



.A^i..iA_ 8^n r (^(1 + ^n)/2) sin^ {y (1 - en) /2) 

( f , ^ r\ sin^ {y{l + en)/2) sin^ {y (1 - 0n) /2) 



1 (y(l + 0„)/2)H!jil e„:)/if , ^ / /■'^"^ (y(l - ^^.)/2)2 



+0 /"~ = 0(/i^) + 0(/i„) + 0{K) = o (1) . (27) 

Therefore, whenever the conditions of Theorem 1 hold, for any 1 < p < oo and any sufficiently 
small vicinity V of /o, we have, using Proposition 1, that for some constant ^4 > 0, and for all 
r > 1 and all sufficiently large n, 



{nhnY/^supBf\\U--Effn\\;< (- 
/ev V 1 



rA 1 
1 + 



+ logry V {nKy/"^)' 

From this using Stirling's approximation, which says for r > 1 that r! > (r/ey', and the 
assumption nhn oo, for sufficiently large n. 



supE/exp (tinhnf^'^Wfn - E//n||p) 
fev ^ ' 

^ (l + logr) 



□ 



Proposition 2. Under the conditions of Theorem 1, for all p > 1 and r > 1, for any e > 
and any sufficiently small vicinity V of fo, there exist n* > and B > such that for all 
n>n* and all f eV 




In order to state the next proposition, we need a concept of the uniform weak convergence 
(see, for example, Ibragimov and Hasminskii (1981), p. 365.) 
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Definition. Let rjg and rj^e, n = 1,2, ... , be random fc-vectors with respective distributions 
Pg and Pn6», n = 1,2,..., that depend on a parameter 6 G Q. We say that rjnd converges weakly 
to rjQ uniformly in 9 E Q, it for any continuous bounded function g : R*^ — > R, uniformly over 
G, as n — >■ oo 

/ g{x)dPne{x) g{x)dPe{x) or Eeg{r]ne) ^ 'Eeg{ve)- 

In what follows, we think of vectors in R*^ as row vectors. The next statement is a uniform 
version of the Central Limit Theorem (CLT) whose proof is similar to that of Theorem 15 in 
Appendix I of Ibragimov and Hasminskii (1981). 

Fact 1. Let Ci,n, • • • , ^n,n; n = 1,2,..., be a sequence of series of random k-vectors that 
are independent within each series, and such that the distribution Pf„ of $^i^n, i = l,...,n., 
depends on a parameter ^ Q. Let Eg^j^„ = G R'^, = X^iLi^*."' ^''^^ '^d\^i,n\'^ < oO; 
where = (a;, a;) is the square of the norm of x . Put 

n 
i=l 

If a'^{0) — )• a'^{0) uniformly in E @ and the Lindeberg condition holds uniformly over @, that 
is, for any r > 

n 

supVEe(|ei,n|'l(|ei,n| >r)) ^0, n^oo, (28) 
then uniformly in G Q 

Cn^N{0,a\e)), n^oo. 

We will apply the preceding fact to the sequence {S,nix) , S,niy)) , n = 1, 2, ... , where S,n{x) is 
the stochastic error of fn{x) in the decomposition (23) and x ^ y. To this end, for n = 1,2,... 
and i = 1 . . . ,n, put 

'ni,n{x, y) = im,n{x), r]i,n{y)), (29) 

where 

Vi,n{x) = [khnix - Xi) - Efkh„{x - Xi)] , xeR, 

and observe that 

n 

{Cn{x),£.n{y)) =^'ninix,y), n = 1,2, .... 
i=l 

Now fix (5 G (0, 1) and consider the region 

Dn = Dn{S) = {{x,y)eB:':\x-y\>N-('-'y'}, N =^^. (30) 

When {x,y) G uniformly in / G ¥{'~f,p,M), the components of {^n{x),^niy) are weakly 
correlated and for all large enough n the uniform CLT applies. 
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Proposition 3. Under the conditions of Theorem 1, for all p > 1 and any e > there exists 
n* > such that for all n > n* , any continuous bounded function g : — )• R, and all 

fe¥{j,p,M), 

sup \Bfg{{Ux),Uy))) - ^f9iC{x,y))\ < e, (31) 

(x,y)eDn 

where C,{x^y) is a mean zero normal random vector with covariance matrix Cov f{(^{x,y)) = 
Diag(7r-V(x),7r-V(y))- 

6.2 Proofs of Lemmas 1—4 

The proofs of Lemmas 1-4 are largely based on Propositions 1-3. 

Proof of Lemma 1. First, we show that for any p > I, uniformly over a small vicinity V of 
/o, as n — >■ oo 



TT 



-l^Ml, xGR, (32) 



where ^ is a standard normal random variable and Mp = (E|A/'(0, 1)1^')^^*'. Indeed, by the CLT 
and the continuity theorem, as n — >■ oo 

Therefore, in order to prove (32) it is sufficient to show that there exists r > such that 

lim limsupsupE/ (|Cn(a;)|^)^+^ < c < oo. (33) 

Note that for any p > 0, 

< cel^l < c(e^ + e""^), a; G R. 
Thus (32) holds provided for a sufficiently small vicinity V of /o 

limsupsupE/e='=^"^^^ < c < oo. (34) 

n— >oo /eV 

The proofs of both cases in (34) are similar. Let us consider the case of the plus sign. By 
independence 

E^expU„(a;)} = |^Ej exp | (fe;,„ (x - X{) - Efk^Ax - Xi))|^ . 
Using Tailor's expansion, formulas (17)-(19), and the fact that / G ¥{'y,p,M) is a bounded 
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function, for some C > and all sufficiently large n, uniformly in / G ¥{'y,p, M), 

E/ exp I \[^{kh^ (x - Xi) - Bfkh„ (x - Xi)) I 



C_ 1 ^ (Clogn)j/2 ^ C 
2n ' ^ ni/2(logny/2 7! ~ ^ 2n ^ n ^ n(i-2)/2 7! - 



C ^ (Clogny 



Therefore for any p > 0, uniformly in / G F(7,p, M), 



limsupE/|^„(x)|^ < (2c)limsup 1 + 



n 



const < oo, 



(35) 



and relation (34), and hence relation (33), follows. 

Next, by conditions (Al) and (A2), Remark 2, and Proposition 2, for any £ > and any 

sufficiently small vicinity V of /o, there exist numbers n* > and -B > such that for all 
n > n* and all / G V the inequalities 



sup / fP/'^{x)dx < £ and sup / E/|^„(x)|Pdx 
}eVJ\x\>B fevJ\x\>B 



< £ 



>\x\>B /eVJ|a:|; 

hold simultaneously. Thus, we only need to show that, locally uniformly in / G F(7,p, M), 

p 



B 



B 



E 



fix] 



TT 



dx = P^, n oo. 



Using the Dominated Convergence Theorem, we get from (32) and (35) that the above limiting 
relation holds true. The proof of Lemma 1 is completed. □ 



Proof of Lemma 2. In the proof, the key role belongs to Proposition 2. From Proposition 
2, for any e > and any sufficiently small vicinity V of /o, there exist n* > and B > such 
that for all n>n*, 



sup 

/ev 

Due to the identity 



Var; ( [ U^Wdx] - Var; f / lU^^dx 

\Jr J \j\x\<B . 



< £. 



Var / 



\x\<B 



|en(x)Rx 



// Cov^ (|4n(x)r, \Uyr) dxdy, (36) 

JJ[-B,B]^ 



the problem is reduced to showing that 



hm hm sup / / Covf HU^W, lUvW) dxdy = 0. 

V\/0 "->00 Jgy J y[_B,B]2 
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As in Proposition 3, consider the region 

D„ = {{x,y) ^B?:\x-y\> N'^^-^y^ iV = 
with 8 G (0, 1) being fixed, and let 1?^ be its complement (in R^). Since 

I [-B, Bf n Dl\ = 0(Ar-(i-'5)/2)^ 

it follows from (36) and Proposition 1 that 

lim lim sup / / Gov/ . \Uy)\'') dxdy = 0, 

V\/o n^oo jgy J J[-B,B]^nD^ 

and it remains to show that 

hm lim sup / / Gov/ . \Uy)\n dxdy = 0. (37) 

V\/o n^oo J J[-B,B]2nD„ 

In order to prove (37) we shall use Proposition 3 and the fact that for 1 < p < 00, 
uniformly in / G F(7,p, M), the sequences {|^n(3;)|^} and {\^,nix)£,niy)\^} are uniformly in- 
tegrable. The uniform integrability of {|Cn(a^)|^} has been already verified (see (35)). The 
sequence {\^n{x)Cn{y)\^} is treated similarly. For any p > and all x, y G R 

^f\Cn{x)^n{yW < cE/(e«"(^)^"(^) +e-^"(^)^"(^)). 
For the plus sign, by independence 

E/exp{^„(x)^n(y)} 

= {■Efe^p{{n-^hn)[khAx - X,) - E^fc^Jx - Xi)][fc^„(y - X,) - Efk^^y - Xi)]})" . 

Then, using (17)-(19), for some C > and all sufficiently large n, uniformly in / G F(7,p, M), 
Efexp{in-^K)[kh„ix - Xi) - Efkh„ix - X,)][kh„iy - ^1) - ^fkhM - ^1)]} 

(00 
E 71 {in-'hn)[khAx - Xi) - EfkhAx - X,)][khAy ' Xi) - Efkn^y - ^1)])' 

, C ^ (Clogn)2i C l^iClognV C 

<l-\ 1-> -^^ ■ =1-1 — . ^ < 1 H . 

2n n-?(logn)-?7'! 2n n n^~^i\ n 

Hence for any 1 < p < 00, uniformly in / G ¥{'y,p, M), 

/ C\" 

limsupE/|^„(x)^„(y)|^ < (2c)limsup I 1 H J = const < 00, 

and the uniform integrability of the sequence {|Cn(3;)^n(y)|^} follows. 

Therefore, by Proposition 3 and the continuity theorem we readily conclude that 

lini lim sup sup Gov/(|^„(a;)|P, |^n(?/)|P) = 0, 
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which yields (37). The proof is completed. □ 



Proof of Lemma 3. It was shown in Section 3.2 that the scaled kernel 

khnix) = h~^kn{h~^x), 

with kn{x) given by (11) and hn = (l — AT"^) , is an asymptotic (fjv, Li)-inultiplier. 
Therefore it follows from (7) and (15) that for any 1 < p < oo, uniformly in / G F(7,p, M), 

||6„(-)||p = \\^fkhA--X)-f{.)\\p = \\kh^*fi.)-f{.)\\p 

< + e-7iv ^ o^iog ^)g_7iv ^ o (n- V2 log n) . 

At the same time, due to Lemma 1, locally uniformly in / G F(7,p, M), 

E/||/n-E/„||p = o(n-V2jvV2^. 

Hence, locally uniformly in / G F(7,p, M), 

ll^nllp = o(E/||/n - 'EffnWp) , 

and the lemma is proved. □ 

Proof of Lemma 4. In view of condition (22), the lemma is an immediate consequence of 
Corollary 1. □. 

7 Proofs of Propositions 

Proof of Proposition 1. Proposition 1 is a modification of Proposition 2 of Mason (2009). 
The latter deals with the kernel estimator 

1 A^./x-x- 



where ii' is a fixed kernel function such that the least decreasing radial majorant of Er^/||K||| 
defined by ^'(x) = sup if^(j/)/||if|||, x G R, is integrable. Mason (2009) suggested a powerful 

|y|>l^l 

method for deriving good asymptotic bound for 

E / |/„(x) - E/„(x)|^'dx, 1 < p < oo, 
J A 

with /„ given by (38) and A being a measurable subset of R. Moreover, he provided a useful 
finite sample bound for 

E||/n-E/„||;, l<p<oo, r>l, 
see also Lemma 4 of Ibragimov and Hasminskii (1980). 
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Unlike Mason, we allow our kernel kn{x) to change with n rather than being fixed. Due to 
properties (12)-(15), this, however, does not effect the method. The integrability of 

^n{x)= sup /s^(y)/||/cn||i, x G R, n = 1, 2, . . . , 

\y\>\A 

where fe„ is the Fejer-type kernel given by (11) is obvious. 

When applying Proposition 2 of Mason (2009) to ^n{x)-, we use the fact that by (12) and 
(14), for any p > 2, 

WKWl < maK\kn{x)r^ J^kl{x)dx = fer^(0)||fcn||i = < (39) 

Note also that for 1 < p < 2, the method of Mason requires the existence of a constant 
A > {2 —p)/p such that 

EIXI^^ < oo, E\Yn\^ < oo, 

where the random variables X and Yn have densities f{x) and A;^(aj)/||fcn||2) respectively. In 
our case, the existence of such A follows from the definition of the class ¥{'y,p, M) and the fact 
that for any < A < 2 and n > 1, cf. Gradshtein and Ryzhik (2000), 3.828.9, 3.828.10, and 
(27), 

When 1 < p < 2 the set {A : (2 — p)/p < A < 2} is non-empty, so that the required constant 
A > (2 — p)/p, for which Ejl^J'^ < oo, does exist. Therefore, for the proof of Proposition 1 we 
refer to Mason (2009), pp. 75-78. 

Proof of Proposition 2. In view of conditions (Al) and (A2), it suffices to show that 
for any S > 0, all p > 2 and r > 1, we can find a constant Cp^r such that for any function 
/GF(7,p,M), 

limsupE^f/ \^nix)\Pdx^ <Cp,r( [ F/\x)dx^ , (41) 

n-S-oo \J\x\>B J \J\x\>B J 

and for any i?>0, alll<p<2 and r > 1, we can find a constant Dp^^ such that for any 
function / G ¥{'y,p,M), 

limsupE/f/ \^n{xWdx^ <Dp,r( [ il + \x\^)fix)dx^ . (42) 

n->oo \J\x\>B J \J\x\>B J 

Inequalities (41) and (42) will be obtained by modifying the arguments in the proof of Propo- 
sition 1 of Mason (2009). One of the key elements of that proof is the following 

Fact 2. (Theorem 1 of Talagrand (1989).) If B is a separable Banach space with norm 
||-||, Zi, i € N, arc independent mean zero random vectors taking values in B, then for a 
universal constant D > 0, for all r > 1 and n > 1, 

(E||5„|r)^/'- < (e||5„|| + (e max \\Z,\A] , (43) 

l + logr \ \ l<i<n J I 
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where Sn = Zi + ■ ■ ■ + Z^- Using the Cr-inequaUty, we get from (43) the bound 

"^ii^-ii' ^ WW {ins4r+B^^^jz4') . (44) 

Adopting Mason's arguments to our needs, we apply the bound (44) to the random func- 
tions 

Zf {x) = {nKr'S^K (^-^) - E/^n (^y^) } >B}, xe-R,i = l,...,n, 

where i? is a positive constant. Because of the properties of kn (see Section 3.2), every Zf [x) 
is in Lp, for any p > 1. The random function defined by 

Snix) = Unix) - Bffn{x))I{\x\ > B}, XE R, 

can be now expressed as follows: 

S^{x) = Z^{x) + ... + Z^{x), xEK. 

Using Jensen's inequality, 

i/p 



BfWS^Wp = Bf\\{U-BfU)I{\-\>B}\\, = BJ [ \U{x)-EfUix)\Pdx] 

\J\x\>B J 

< [Ef [ \Ux)-BUx)\Pdx] =(E;||5f||^)'/^ (45) 

Next, according to Mason (2009), pp. 71-72, for any / G F(7,p,M) and any S > 0, 

/ \j^{x)-^^Ux)fdx < ^ I dx( [ ^kl^)f{y)dy, p = 2, (46) 

J\x\>B ntln Jlx\>B \Jli l^n J 



if [ \Ux) - BfUx)\P dx < [ il ^kl f{y) dyX'^ dx 

J\x\>B [nhnJP/^ J\x\>B \Jr K \ K J J 



\p/2 

i\x\>B ' {nrinr^ J\x\>B 

E^ / \Ux) - BfUx)\P dx < [ {Kl * f{x)Y^' dx, l<p<2, (48) 

J\x\>B inhnY'^ J\x\>B 

where 

KUx) = h-'kUh-^x), xeR. 
Case la. 2 < p < oo. Prom (47) 
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f[ \fnix)--EfUx)\Pdx 

J\x\>B 



< 



Consider the integral on the right-hand side of (49) . Using the rough version of the c^-inequahty 
that says that for all r > 

\x + y\'<2^\xr + \yn, 

we get 

\P/2 

( / —k''n{ — ]fix-t)d 

\x\>B 



= / if r*" (r)f(''-t)dt+ f ^ki [-f] f(x-t) dt] 

J\x\>B \J\t\<B/2 i^n \llnj J\t\>B/2 \nn / j 

<2P/^ [ ([ —k'^(—^fix-t)dt] + 

J\x\>B \J\t\<B/2 K \hnj J 

+2^/2 / ( [ ^kl (f) f{x-t) dX^ =: T, + T,. 



p/2 

dx 



(50) 



The first integral in (50) is estimated as follows: 



\ p/2 



T,<2P/' [ F'\s)dsl! ^kl(f]dt 

J\s\>B/2 \jRi''n \nnj J 

= 2^/2 / fl\s)ds[\ kUx)dxY^' =2P/'\\knr2 [ F'\s)ds. 

J\s\>B/2 \Jr J J\s\>Bl2 



For the second integral we have 

p/2 



7r \j\t\>Bl2 l^n \iinj ) 

= 2P/2 [ fP/^{s)ds I [ kl {x) dx] 

Jr. \J\x\>B/2hn J 
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Therefore using (49) and (50) 



/ \fnix)-BUix)\Pdx 
J\x\>B 



< 



L \ 2"/'ll^n||^ / F^Hx) dx + 2^1^ [ r/\x) dx [ kl {x) dx 

I J\x\>B/2 Jr J\x\>B/2hn 



vl 



2*'Cp||A;„||p 



From this, due to (45) and using the rough version of the c^-inequahty, for any r > 1, 



(E/||5fy^< |E^|^^^^|/„(x)-E//„(x)rcix| <2'-/^{^ 



F^^{x)dx + 2P/^ [ fP/\x)dx I kl{x)dx 

J\x\>B JR J\x\>B/2h„ 



2PCp ^'''P 



X < 



)p/2 

p/2 ^ -"/P 



{nhn) 



p-i 



r/p 



r/p 



n||2| / r'''{x)dx 
\x\>B . 



/o V'^ ( f 9 2'^IP2'^Cl'^\\kn\\l 



RecalUng that, as n — >■ oo. 



we get 



and hence 



iV = 0(logn), 0„ = l-_ = O(l), hn = ^ = 0{log-^n), 



kl{x) 



COs{9nX) — COs(x) 



7r(l - en)x^ 



2 2Ar2 
< 



TT^X^' 



i 



\x\>B/2h„ 



k^ (x) dx < 



2N^ r 

J\x 



dx 



\x\>B/2hr, 



= 0{N''hl) = 0{hn). 



Now, by Fact 2 apphed to the random process S'^ = + . . . + Z^, we have, cf. (44), 



E/I|5nli;< 



(l + logrrv(^^"l^""l^)^ + ^^m""^^""^ 
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where, using Cr-inequality, 



= Ef max 

l<i<n 



|a;|>B 



nhr, 



E/max||Zf||; 



< 2''+^E max 

l<i<n \ .l\x\>B 



hfi 



hr. 



r/p 



dx 



r/p 



dx < 



We see then by (51)-(54) that for all sufficiently large n, 



2'+^K/^\\kn\\; 

[nhnY 



r/p 



(54) 



E;||5^||; = E; f / \Ux) - ^fUx)\^dx] 

\J\x\>B ) 
p,rnr-l„rc)2r/pc)Zr/2(^r/p { / r \'^^^ 



+ 



Next, by (39) for p > 2 



22r- 1 rr2''/PCl'^\\kn\\l 2^ 1 1 k 



+ 



and 



{l + \ogrY{nhn)^P-^'>''/P (1 + logr)'(n/i„)'' 



(n^„)(p-i)'-/?' 



(n/in)' 



Therefore for all p > 2, r > 1, and iJ > there exists a number n* > such that for all n> n* 

r/p 



E, 



< 



V ( / \fn{x) - ^ffnixWdx 
\J\x\>B 

[ fP/\x)dx] +c^K I F'\x)dx 



^p,r J. hn ^ Cp^r 



fP/^{x)dx 



r/p 



(n/i„)(P-i)'-/f (n/i„)" {nhnY/'^ \J\x\>B/2 
where ap^r, bp^r, Cp,r, and Cp^r are constants that may depend on p and r. Noticing that 

Ux) = {nKf'\Ux) - BfUix)) 



(55) 



yields (41). 
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Case lb. p = 2. In this case, the proof is analogous to that of Case la, but even easier. Using 
(46) we get 

/ \Ux) - ^fUx)\Hx f dxj ^kl (^] f{y) dy. 

J\x\>B ntln J\x\>B JR i^n \ lin J 

Then by analogy with (51) 



1/2' 



% I / \fn{x)--EfU{x)\'dx 

\x\>B . 



< 



L 



r/2 



llfenlli / f{x)dx+ [ rl\x)dx [ 
J\x\>B JR J\3 



r/2 



x\>B/2hn 



(x) dx 



From this, using (52)-(54), 



r^/2r>r-l„ 



(l + logr)- [nhnY/'' 
{1 + log rYinhnY 



||^n||2 



/ f{x) dx ) + Cjhn / f{x) dx 
\J\x\>B I Jr. 



Note that hl/^/{nhnY — >■ and, by (14), < tt ^. Therefore for all r > 1 there exists a 

number n* > such that for all n>n* and all / G F(7, 2, M), 



E 



([ \fn{x)-BfU{x)\^dx] < 
\J\x\>B J 



{nhnY^'^ \J\x\>B/2 



r/2 



f{x) dx 



(56) 



where Cr is a positive constant that may depend on r. Because of the relation (55) the bound 
(56) gives (41) when p = 2. 

Case 2. 1 < p < 2. We know that (see (53)) 



BfWS^W; = ^f([ \Ux)-EfUxWdx] 

\J\x\>B J 



where by (45) and (48) 



\J\x\>B 



r/p 
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and using (54) 



tjf max z,- ^ 



l<i<n 



Thus, for any 1 < p < 2 there exists a constant Lp > such that for aU r > 1, 



(nhn) 



r/2 



+ ^ 



(n/i„) 



(57) 



where 



Kti^) = h-'ki{h-'x), xeR 



Thanks to Lemma 1 of Mason (2009), for all n > 1, the integral on the right-hand side of (57) 
is bounded by 

/ «*/(a;))^/'da;<C(A,p)||fe„i{Ey(l + |X,J^)I(|X,J >5)r^'. (58) 

J\x\>B ^ 



Here 



^h„ = X + Yh„, 



where X has density /(x), Y/j„ has density -f^^^(a;)/||^n||2) ^i^d Y^^ is independent of X. 
Inequality (58) implies that 



hmsup/ {Kl*f{x)Y^^dx < C{\,p)\\knrJEf{l + \X\>^)Ii\X\>B)Y^^ 

n->oo J\x\>B 

,{X,p)( f {l + \x\^)fix)dx] . (59) 

\J\x\>B J 



< Ci( 



The proof of the upper line in (59) repeats that of relation (20) in Lemma 1 of Mason (2009) 
given for fixed kernel independent of n. We only need to show that for any {2—p)/p<X<2, 
cf. bound (24) in Mason (2009), 

E|y;,J^^0, n^oo, 
which is true in view of (27). Next for all 1 < p < 2 and all r > 1, 



i<"n W^ri 

{nhn) 



n — )■ oo. 



Therefore from the bounds (57) and (59) we get that for any r > 1 there exists a number 
n* > such that for all n> n* 



([ \Uix)-BfUx)\Pdx] <^I^([ {l + \x\^)f{x)d^ , 

\J\x\>B ) [nhnY'^ \J\x\>B J 
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with some constant Dp ^ > 0. In view of condition (Al), this yields by (55) the statement of 
Proposition 2 in the case 1 < p < 2. The proof is completed. □ 

Proof of Proposition 3. Note that by (18) and (19), for all x G R, uniformly in / G 
F(7,p,M), 

YsiYfinix) = l^ + 0{N-^ log N). (60) 

TT 

Now let Dn = Dn{5) be the region defined as in (30). When (x, y) G the random functions 
^n{x) and ^n(y) are weakly correlated and for all large enough n, uniformly in / G F(7,p, M), 

sup Covf{Ux),Uy)) = 0{N-^). (61) 

{x,y)eDn 

Indeed, keeping in mind the properties of / G ¥{'y,p,M) (see Section 2.4), using (16) and 
(18) we obtain that for any y G R, uniformly over F(7,p, M), 

sup \Cov f{kh^{x - Xi),khjy - Xi)) - khjy - x)f{x)\ 
< Whr. * [khM - ■)/(■)] - hr^iy - ■)f{-)\\oo + W^knA- - X,)\\oo 

< S^X^^g + II^^JIi) ,-.iv + Oil) = 0(log AT). 

TT 

Thus, for all x, y G R, uniformly over ¥{'j,p, M), 

Cov/(fe„„(x - X,),khAy - Xi)) = khAy - x)f{x) + O(logiV), (62) 

where for some constant C = C{'y,p,M) > 0, 

Nhnfix) C 

\khAy - x)fix)\ = — ^ |cos(iV(x - y)) - cos{h~^{x - y))\ < 

Tr{x — yj^ (x — yy 

The last inequality implies that for all large enough n, uniformly over F(7,p, M), 

sup \kn^{y~x)f{x)\ = 0{N^-'). 

{x,y)eD„ 

Therefore, noting that 

Cov/(^„(x),Cn(y)) = hnCovf{kh„ix - Xi),kh„iy - Xi)), 

where K = N'^l - N''^), and using (62) wc get (61). 

We next turn to the sequence {Cn{x), Cniy)) = EiLi Vi,n{x, v), n = 1,2, . . . , where rji^^ix, y) -- 
{ili,n{x) , ili,n{y)) is given by (29), and show that the Lindeberg condition 

lim sup nEf {\m,n\Wvi,n\ > r)) = 0, V r > 0, (63) 

{x,y)eD„ 
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holds uniformly in / € F(7,p, M). To this end observe that by (19) and Chebyshev's inequality, 
uniformly in x G R and / G F(7,p, M), 

P/ {{ViA^)? > tV2) = O(n-i). 

Therefore by (17) and (18) for all sufEciently large n and some constant C = C{'y,p,M) > 0, 
uniformly over F(7,p, M), 

sup nE/ (|T7i,npI(|T7i_„| > r)) 

<C{nK)-^ sup nP f {{r,n,i{x)f + {r]n,i{y)? > t'^) 
< 2C{nhn)-^ supnP/ {{m,n{^)? > ^V2) 
= 0{{nhn)-')=oil). 

This ensures the validity of (63). 

Finally, using Fact 1 we infer from (60), (61), and (63) that the statement of Proposition 
3 holds. □ 
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